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The RNA genome of the hepatitis C virus (HCV) undergoes rapid 
evolutionary change. Efforts to control this virus would benefit from the 
advent of facile methods to identify characteristic features of HCV RNA and 
proteins, and to condense the vast amount of mutational data into a readily 
interpretable form. Many HCV sequences are available in GenBank. To 
facilitate analysis, consensus sequences were constructed to eliminate the 
overrepresentation of certain genotypes, such as genotype 1, and a novel 
package of sequence analysis tools was developed. Mutation Master generates 
profiles of point mutations in a population of sequences and produces a set 
of visual displays and tables indicating the number, frequency, and 
character of substitutions. It can be used to analyze hundreds of sequences 
at a time. When applied to 255 HCV core protein sequences, Mutation Master 
identified variable domains and a series of mutations meriting further 
investigation. It flagged position 4, for example, where 90% or more of all 
sequences in genotypes 1, 2, 4, and 5, have N4 , whereas those in genotypes 
3, 6, 7, 8, 9, and 10 have L4 . This pattern is noteworthy: L (hydrophobic) 
to N (polar) substitutions are generally rare, and genotypes 1, 2, 4, and 5 
do not form a recognized super family of sequences. Thus, the L4N 
substitution probably arose independently several times. Moreover, not one 
member of genotypes 1, 2, 4, or 5 has L4 and not one member of genotypes 3, 
6, 7, 8, 9, or 10 has N4 . This nonover lapping pattern suggests that 
coordinated changes at position 4 and a second site are required to yield a 
viable virus. The package generated a table of genotype-specific 
substitutions whose future analysis may help to identify interacting amino 
acids. Three substitutions were present in 100% of genotype 2 members and 
absent from all others: A68D, R74K, and R114H. Finally, this study revealed 
thatARFP, a novel protein encoded in an overlapping reading frame, is as 
conserved as conventional HCV proteins, a result supporting a role for ARFP 
in the viral life cycle. Whereas most conventional programs for 
phylogenetic analysis of sequences provide information about overall 
relatedness of genes or genomes, this program highlights and profiles point 
mutations . This is important because determinants of pathogenicity and drug 
susceptibility are likely to result from changes at only one or two key 
nucleotides or amino acid sites, and would not be detected by the type of 
pairwise comparisons that have usually been performed on HCV to date. This 
study is the first application of Mutation Master, which is now available 
upon request (http://tandem.biomath.mssm.edu/mutationmaster.html) . 
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Many viruses have overlapping genes and/or regions in which a nucleic 
acid signal is embedded in a coding sequence. To search for dual -use 
regions in the hepatitis C virus (HCV) , we developed a facile 
computer-based sequence analysis method to map dual -use regions in coding 
sequences. Eight diverse full-length HCV RNA and polyprotein sequences were 
aligned and analyzed. A cluster of unusually conserved synonymous codons 
was found in the core-encoding region, indicating a potential overlapping 
open reading frame (ORF) . Four peptides (Al, A2 , A3, and A4) representing 
this alternate reading frame protein (ARFP) , two others from the HCV core 
protein, and one from bovine serum albumin (BSA) were conjugated to BSA and 
used in western blots to test sera for specific antibodies from 100 chronic 
HCV patients, 44 healthy controls, and 60 patients with non-HCV liver 
disease. At a 1:20,000 dilution, specific IgGs to three of the four ARFP 
peptides were detected in chronic HCV sera. Reactivity to either the Al or 
A3 peptides (both ARFP derived) was significantly associated with chronic 
HCV infection, when compared to non-HCV liver disease serum samples (10/100 
versus 1/60; p < 0.025). Antibodies to A4 were not detected in any serum 
sample. Our western blot assays confirmed the presence of specific 
antibodies to^a new HCV antigen encoded, at least fin part, in an alternate 
reading frame (ARF) overlapping the core-encoding region. Because this 
novel HCV protein stimulates specific immune responses, it has potential 
value in diagnostic tests and as a component of vaccines. This protein is 
predicted to be highly basic and may play a role in HCV replication, 
pathogenesis, and carcinogenesis. 
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Hepatitis C virus (HCV) is an important human pathogen that affects 
approximately 100 million people worldwide. Its RNA genome codes for a 
polyprotein, which is cleaved by viral and cellular proteases to produce at 
least 10 mature viral protein products. We report here the discovery of a 
novel HCV protein synthesized by ribosomal frameshift. This protein, which 
we named the F protein, is synthesized from the initiation codon of the 
polyprotein sequence followed by ribosomal frameshift into the -2/ + 1 
reading frame. This ribosomal frameshift requires only codons 8-14 of the 
core protein-coding sequence, and the shift junction is located at or near 
codon 11. An F protein analog synthesized in vitro reacted with the sera of 
HCV patients but not with the sera of hepatitis B patients, indicating the 
expression of the F protein during natural HCV infection. This unexpected 
finding may open new avenues for the development of anti-HCV drugs. 
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